PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA04g16620
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family HD-ZIP
Protein Properties Length: 195aa    MW: 22426.4 Da    PI: 8.9277
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA04g16620genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.81.8e-1868122256
                 T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 rk+ ++tkeq ++Le+ F+++ +++ +++++LA++l+L  rqV vWFqNrRa+ k
  CA04g16620  68 RKKLRLTKEQSDVLEDSFKEHTTLNSKQKRDLARRLSLRPRQVEVWFQNRRARTK 122
                 788899***********************************************98 PP

2HD-ZIP_I/II121.25.2e-3968156190
  HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLree 90 
                  +kk+rl+keq+ +LE+sF+e+++L++++K +lar+L l+prqv+vWFqnrRARtk+kq+E+d+e L+++y+ lkeen+rL+ke +eL+ +
   CA04g16620  68 RKKLRLTKEQSDVLEDSFKEHTTLNSKQKRDLARRLSLRPRQVEVWFQNRRARTKLKQTEVDCEILRKCYEDLKEENRRLNKEIQELK-S 156
                  69*************************************************************************************9.4 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.63E-1856125IPR009057Homeodomain-like
PROSITE profilePS5007117.03864124IPR001356Homeobox domain
SMARTSM003895.5E-1666128IPR001356Homeobox domain
CDDcd000861.58E-1368125No hitNo description
Gene3DG3DSA:1.10.10.606.0E-1868122IPR009057Homeodomain-like
PfamPF000467.0E-1668122IPR001356Homeobox domain
PROSITE patternPS00027099122IPR017970Homeobox, conserved site
PfamPF021838.4E-11124157IPR003106Leucine zipper, homeobox-associated
SMARTSM003401.0E-17124167IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 195 aa     Download sequence    Send to blast
MINPQLLGGV NSSVSSLSNT SVKRERDASS LEEEVENLET KKVVLISPKV LVHNDDDDDE  60
DVHVYGTRKK LRLTKEQSDV LEDSFKEHTT LNSKQKRDLA RRLSLRPRQV EVWFQNRRAR  120
TKLKQTEVDC EILRKCYEDL KEENRRLNKE IQELKSLKMS APFRLQLSAA TLSMCPSCER  180
STYGGSTNRI FITN*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
16672TRKKLRL
2116124RRARTKLKQ
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755162e-52HG975516.1 Solanum lycopersicum chromosome ch04, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006356814.11e-103PREDICTED: homeobox-leucine zipper protein HAT22-like
SwissprotP466041e-59HAT22_ARATH; Homeobox-leucine zipper protein HAT22
TrEMBLM1A1J31e-102M1A1J3_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000126151e-102(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA29024186
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G37790.11e-56HD-ZIP family protein